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Description 



SYSTEM AND METHOD FOR 
DYNAMICALLY EXECUTING A 
FUNCTIONIN A PROGRAMMABLE LOGIC 
ARRAY 

Background of Invention 
[0001] FIELD OF THE INVENTION 

[0002] The present invention generally relates to the field of logic 
devices. More particularly, the present invention is di- 
rected to a system and method for dynamically executing 
a function in a programmable logic array. 

[0003] BACKGROUND OF THE INVENTION 

[0004] Recently, programmable logic devices, such as field pro- 
grammable gate arrays (FPGAs), have evolved into devices 
capable of being dynamically reconfigured on a cycle- 
by-cycle basis to perform logic functions, or portion(s) 
thereof, during each cycle. With the increasing speed of 
such reconfigurable FPGAs (RFPGAs), RFPGAs are now be- 



ing considered as components suitable for use in applica- 
tion specific integrated circuits (ASICs) and other inte- 
grated circuits (ICs). For example, ICs for applications that 
require certain data to be manipulated iteratively in order 
to produce a desired output are ICs that can benefit from 
the utilization of RFPGAs. Examples of such applications 
include communication protocol processing and image 
processing, among others. 
[0005] a benefit of using an RFPGA to perform one or more func- 
tions that would otherwise be performed by fixed logic is 
often a reduction in the amount of chip area needed to 
perform these functions. For example, a certain function 
may be partitioned into ten functional blocks that are 
chained together such that the output of one block be- 
comes the input to the next block, and so on. Using con- 
ventional fixed logic, all ten of these functional blocks 
would typically be placed on the chip next to one another 
in their proper data-flow sequence. Using an RFPGA solu- 
tion, however, these ten functional blocks can be replaced 
with a single RFPGA that is cyclically reconfigured to pro- 
cess each of the ten functional blocks in the proper se- 
quence. 

[0006] Generally, the processing of the ten functional block func- 



tion proceeds as follows. The RFPGA is configured for the 
first functional block and the input data of the function is 
passed into the RFPGA, i.e., the first functional block. The 
output of the first functional block then passes out of the 
block and is stored in certain storage elements created 
within the RFPGA. These storage elements store the out- 
put of the first functional block for use as the input to the 
second functional block of the function, which is created 
by reconfiguring portions of the RFPGA other than storage 
elements. At the same time portions of the RFPGA are be- 
ing reconfigured for the second functional block, other 
portions of the RFPGA are reconfigured as storage ele- 
ments for storing the output of the second functional 
block, which is also the input for the third functional 
block. This process is repeated for the third through the 
ninth functional blocks. The process for the tenth, and fi- 
nal, functional block is slightly different, with the output 
of the tenth functional block being passed to the external 
I/O elements of the RFPGA, where the output is passed to 
other portions of the IC. 
[0007] while processing multiple functional blocks in series using 
an RFPGA is conceptually straightforward, conventional 
RFPGA reconfiguration techniques make it difficult to op- 



timize each of the various configurations of the RFPGA 
needed to perform a given function. However, optimizing 
IC designs is important. This difficulty in optimizing con- 
figurations of an RFPGA is largely due to the way in which 
conventional techniques handle the data being passed 
from one configuration of logic, i.e., functional block, to 
the next configuration of logic. 
[0008] FIG. 1 illustrates the configuration of an RFPGA 20, re- 
spectively, when configured for the Mth functional block 
of a function consisting of three or more functional 
blocks. As a convention, the current functional block, i.e., 
the functional block configured in RFPGA 20 at a given 
time, will be referred to as the Mth functional block. Thus, 
if the Mth functional block is the second functional block, 
the first functional block is referred to as the "M-l" func- 
tional block, the third functional block is the 
"M+rfunctional block, the fourth functional block is the 
"M+2" functional block, and so on. Each functional block 
other than the first and last functional block will have as- 
sociated therewith "M-l"storage elements, i.e., the out- 
puts of the "M-l" functional block, which are the inputs to 
the Mth functional block, and "M"storage elements, which 
contain the outputs of the Mth functional block. The first 



functional block does not need any "M-l" storage ele- 
ments since the input to the first functional block comes 
from circuitry (not shown) located outside RFPGA 20 via 1/ 
O elements 24, and the last functional block does not 
need any "M" storage elements since the output of the last 
functional block is passed out of the RFPGA. The same 
nomenclature is also to be applied to the cycling of the 
functional blocks from one to the next as reconfiguration 
and processing proceed. For example, if the reference cy- 
cle is denoted the "M"cycle, the immediately prior cycle is 
the "M-l" cycle and the next cycle is the "M+l"cycle. 
[0009] RFPGA 20 generally includes an array of configurable logic 
blocks (CLBs) 28, or other (re)programmable logic ele- 
ments, interconnected to one another via interconnect re- 
sources 32. CLBs 28 are the reconfigurable functional ele- 
ments, e.g., logic gates, used to implement the logic of 
each functional block of the desired function and the 
storage elements needed to store data being passed from 
one functional block to the next. Each CLB 28 pro- 
grammed as a functional element is shown as a square 
containing an "F,"each CLB programmed as a correspond- 
ing "M-l"storage element is shown as a square containing 
an "M-l"and each CLB programmed as a corresponding 



"M"storage element is shown as a square containing an 
"M." RFPGA 20 is in electrical communication with I/O 
cells 24, which provide communication between the RF- 
PGA and other circuits (not shown), e.g., memory, proces- 
sor, communication or other circuits. 

[0010] As shown in FIG. 1, conventional RFPGA programming 

techniques result in storage elements M-l, M of the first 
functional block being scattered across the array of CLBs 
28 at locations that locally optimize the logic paths 
through functional elements. The problem with this ap- 
proach begins to appear, however, when a programmer 
36 responsible for reconfiguring (programming) RFPGA 20 
with each set of functional block and storage elements 
goes to reprogram the RFPGA with the second (and sub- 
sequent) functional blocks and corresponding storage lo- 
cations. To reprogram RFPGA 20, programmer 36 must 
take into account the locations of scattered storage ele- 
ments M so as to not interfere with the data stored there, 
which is the input to the next functional block. The pro- 
cess of optimizing a functional block and its storage ele- 
ments is hampered by the scattered nature of the storage 
elements from the previous functional block. 

[0011] The scattered nature of storage elements M-l, M also de- 



grades the overall performance and cycle time of the 
function being processed by RFPGA 20. The length of de- 
lay from a storage element to the worst-case entry point 
into the next functional block becomes the limiting factor 
in the cycle time of RFPGA 20. Moreover, depending upon 
the path delay per cycle, the cycle time would vary from 
one cycle to the next. This could cause the worst cycle 
frequency to determine the overall processing speed of 
the function. This problem is directly dependent on the 
placement of the storage elements to determine the oper- 
ating frequency of the function. 
[0012] The ability of the programmer to place storage elements 
at appropriate locations to facilitate fast cycle times be- 
comes dependent on not just the current functional block 
configuration requirements, but also on all subsequent 
functional block configuration requirements. That is, the 
locations of storage elements for the current functional 
block configuration affects the locations of the storage el- 
ements for the next functional block configuration, which 
affects the locations of storage elements for the following 
functional block configuration, and so on. This creates a 
combinatorial explosion of possible options for which the 
programmer must find a maximum cycle time for each cy- 



cle and an overall cycle time. For complex functional 
blocks, conventional RFPCA programmers can take several 
hours to produce a viable solution for one functional 
block configuration or cycle. It is readily seen that achiev- 
ing a viable solution for multiple functional block configu- 
rations can require a prohibitive amount of time. A need 
therefore exists for configuring functional blocks in an ef- 
ficient manner. 
Summary of Invention 

[0013] | n one aspect, the present invention is directed to a sys- 
tem for processing a logic function comprising a plurality 
of functional blocks each having output data. The system 
comprises a first storage region, a second storage region 
and a reconfigurable logic array having defined therein a 
function region located outside the first and second stor- 
age regions. A programmer is in communication with the 
reconfigurable logic array and operatively configured to 
program the function region with ones of the plurality of 
functional blocks on a cyclical basis such that the output 
of successive ones of the plurality of functional blocks is 
stored in the first and second storage regions on an alter- 
nating basis. 

[0014] ^ another aspect, the present invention is directed to a 



method of processing a function partitioned into a plural- 
ity of functional blocks. The method comprises the step of 
programming a reconfigurable logic array with a first one 
of the plurality of functional blocks so that the first one of 
the plurality of functional blocks processes in a first di- 
rection. Then, the first one of the plurality of functional 
blocks is processed in the first direction. The reconfig- 
urable logic array is programmed with a second one of the 
plurality of functional blocks so that the second one of the 
plurality functional blocks processes in a second direction 
substantially opposite the first direction. The second one 
of the plurality of functional blocks is processed in the 
second direction. 
Brief Description of Drawings 

[0015] For the purpose of illustrating the invention, the drawings 
show a form of the invention that is presently preferred. 
However, it should be understood that the present inven- 
tion is not limited to the precise arrangements and instru- 
mentalities shown in the drawings, wherein: 

[0016] FIG. 1 is a schematic diagram of a reconfigurable field 
programmable gate array (RFPCA) configured in accor- 
dance with the prior art for one functional block of a 
function having a plurality of functional blocks; 



[0017] FIG. 2 is a schematic diagram of an integrated circuit chip 
containing a reconfigurable logic array (RLA) system of the 
present invention; 

[0018] FIG. 3A is a high-level schematic diagram of a partitioned 
function that may be processed by the RLA system of FIG. 
2; FIGS. 3B-3D are high-level schematic diagrams of the 
RLA system of FIG. 2 wherein the RLA is configured, re- 
spectively, for processing the first, second and third func- 
tional blocks of the function illustrated in FIG. 3A; 

[0019] FIG. 4 is a timing diagram illustrating the cyclical pro- 
gramming of the RLA of FIGS. 3B-3D for processing the 
function of FIG. 3A; 

[0020] FIG. 5 a high-level schematic diagram of an RLA system of 
the present invention wherein some of the storage regions 
are not located at the edge of the RLA and functional 
blocks of multiple functions are programmed and pro- 
cessed simultaneously with one another; 

[0021] FIG. 6 is a high-level schematic diagram of an RLA system 
of the present invention wherein two functions share the 
same storage regions and are processed simultaneously in 
opposite directions from one another; 

[0022] FIG. 7A is a high-level schematic diagram of an RLA sys- 
tem of the present invention wherein the RLA contains two 



function regions so that one function region can be pro- 
grammed while the other function region is processing; 
FIG. 7B is a timing diagram for processing a function us- 
ing the dual-functional-region structure of FIG. 7A; and 
[0023] FIG. 8 is a high-level schematic diagram of an RLA system 
of the present invention wherein the storage elements are 
fixed memories. 
Detailed Description 

[0024] Referring now to the drawings, FIG. 2 shows in accordance 
with the present invention an integrated circuit (IC) chip, 
which is identified generally by the numeral 100. Chip 100 
may be any type of IC chip, such as a logic chip, an appli- 
cation specific IC (ASIC) chip, or microprocessor chip, 
among others. Chip 100 contains at least one reconfig- 
urable logic array (RLA) system 104 that includes an RLA 
108 and a programmer 112 for reconfiguring the logic 
structure of the RLA. RLA 108 may be any suitable logic 
array, e.g., a field programmable gate array (FPGA), that 
may be cyclically reconfigured to implement any logic 
cone having a size that can be accommodated by the RLA. 
Although not illustrated, RLA 108 may include a plurality 
of reprogrammable logic elements (RLEs) that may be the 
same as or generally similar to CLBs 28 of the prior art 



RFPGA 20 illustrated in FIG. 1. As is well known in the art, 
the RLEs generally are the elements of RLA 108 that are 
reconfigurable to function as various logic gates or logic 
blocks, depending upon the granularity of the RLA. Pro- 
grammer 112 is in electrical communication with the RLEs 
via configuration circuitry 116. As discussed below in de- 
tail, the function of programmer 112 is to reconfigure the 
RLEs on a periodic basis to configure RLA 108 so that it 
contains one or more particular logic function(s), or por- 
tion^) thereof, and corresponding storage elements, if 
needed. 

[0025] As discussed in the background section above, a logic 
function that is larger than the size that a particular RLA 
can process in a single configuration cycle can be parti- 
tioned into a number of functional blocks each having a 
size small enough to be processed in one cycle of the RLA. 
In contrast to a conventional programmer, e.g., program- 
mer 36 of FIG. 1, that scatters storage elements through- 
out the logic array, as discussed below in detail, program- 
mer 112 of the present invention essentially configures 
within RLA 108 (among the RLEs) at least a function region 
and, typically except for the last one of the functional 
blocks, a corresponding storage region for storing and 



holding output from the function region. 

[0026] Referring to FIG. 3A, consider a function F partitioned into 
three functional blocks FBI, FB2, FB3. Of course, those 
skilled in the art will appreciate that the number of func- 
tional blocks is dependent upon the size of the function 
and the size of the region of the RLA devoted to functional 
logic and that the number of functional blocks may be any 
integer greater than one. However, for simplicity, the 
present explanation is limited to three functional blocks 
FBI, FB2, FB3. Those skilled in the art will readily under- 
stand how to make the changes necessary to implement 
any other number of functional blocks. 

[0027] As illustrated by FIG. 3B, during the first configuration cy- 
cle, i.e., cycle M, programmer 112 configures function re- 
gion FR with functional block FBI and a corresponding 
storage region SRI for receiving the output of FBI, leaving 
a region R of RLA 108 outside of the function and storage 
regions unconfigured (in terms of logic or storage config- 
uration) for being configured in the next configuration cy- 
cle, cycle M+l, as storage region SR2 (FIG. 3C). (It is noted 
that in other embodiments, storage region SR2 of FIG. 3C 
could be configured during cycle M, if desired.) Input data, 
represented by arrows 120, is then passed into RLA 108 



and processed by functional block FBI. The output data of 
functional block FBI, represented by arrows 124, passes 
from functional block FBI to storage elements (not shown) 
within storage region SRI. 
[0028] on the second configuration cycle (FIG. 3C), cycle M+l, 
programmer 112 reconfigures function region FR with 
functional block FB2 and configures storage region SR2 
for receiving the output of functional block FB2. Program- 
mer 112 essentially leaves storage region SRI alone, ex- 
cept for making the appropriate connections between the 
storage cells therein and functional block FB2, so as to not 
disturb the output data of functional block FBI, which is 
the input data for functional block FB2. Once programmer 
112 has configured function region FR and storage region 
SR2, RLA 108 enters into a processing cycle during which 
the data stored in storage region SRI passes into (arrows 
128) functional block FB2 and corresponding output data, 
represented by arrows 132, passes from functional block 
FB2 into storage elements (not shown) within storage re- 
gion SR2. 

[0029] on the third configuration cycle (FIG. 3D), cycle M+2, pro- 
grammer 112 reconfigures function region FR with func- 
tional block FB3 and any connections needed with the 



storage elements within storage region SR2 so that the 
output data of FB2 of the previous cycle can be input into 
FB3. Since function F under consideration has only three 
functional blocks FBI, FB2, FB3, programmer 112 need 
not reconfigure storage elements within storage region 
SRI, since the output of functional block FB3 can be 
passed directly out of RLA 108. Once programmer 112 
has reconfigured function region FR, RLA 108 enters into 
an execution cycle during which the data stored in storage 
region SR2 passes into (arrows 136) functional block FB3 
and corresponding output data, represented by arrows 
140, passes from functional block FB3 out of the RLA. 
[0030] jo maximize the speed at which programmer 112 can 
configure function region FR with functional blocks FBI, 
FB2, FB3, each configuration of the function region, i.e., 
the configuration for functional block FBI, the configura- 
tion for functional block FB2 and the configuration for 
functional block FB3, may be "precompiled" and stored in 
a memory (not shown). In this manner, programmer 112 
simply "loads" function region with the appropriate pre- 
compiled configuration during each cycle. Alternatively, 
programmer 112 may be configured to determine the 
necessary configuration for function region FR during 



each cycle. Factors to be considered in determining which 
of these alternatives is more appropriate for a particular 
design include the cycle time, size of function array FR 
and amount of memory available for storing the precom- 
piled configurations. Of course, it may also be desirable 
for programmer 112 to implement a hybrid of these alter- 
natives wherein certain logic segments used over and over 
again are precompiled but the programmer determines 
the overall configuration for each functional block FBI, 
FB2, FB3 during each cycle. 
[0031] Those skilled in the art will readily understand how the 
multi-cycle reconfiguration and processing method just 
described can be generalized to a function having any 
number of functional blocks. For a function having more 
than three functional blocks, there will obviously be more 
configuration and processing cycles during which the ad- 
ditional functional block(s) are configured and data is pro- 
cessed through the additional functional blocks. With the 
configuration of function region FR and storage regions 
SRI, SR2 shown in FIGS. 3B-3D, the odd-numbered func- 
tional blocks, e.g., functional blocks FBI and FB3, will be 
executed from left to right, whereas the even-numbered 
functional blocks, e.g., FB2, will be executed from right to 



left. This orderly process of configuring and executing the 
various functional blocks of a given function within func- 
tion region FR and designating, using and reusing storage 
regions SRI, SR2 as needed, allow the processing of func- 
tion F to be readily optimized. 
[0032] Although the configuration and execution of functional 

blocks, e.g., functional blocks FBI, FB2, FB3, has been de- 
scribed in the context of left-to-right, right-to-left exe- 
cution, storage regions SRI, SR2 can be located anywhere 
relative to function region FR. For example, storage re- 
gions SRI, SR2 can be reversed so that execution of the 
functional blocks proceeds right-to-left and then left- 
to-right and so on, or storage regions SRI, SR2 may be 
located above and below function region FR so that pro- 
cessing of the functional blocks proceeds either top- 
to-bottom, bottom-to-top or bottom-to-top, top- 
to-bottom. Other processing configurations are also pos- 
sible, such as left-to-top, top-to-left, but may not be de- 
sirable due to unacceptable delays in the longer logic 
paths through the corresponding functional block. The 
processing direction of the functional block will generally 
depend upon the design of the particular RLA utilized. 
Conventional RLEs are typically relatively asymmetric in 



terms of processing efficiency, i.e., the processing speed 
in one direction is much faster than the processing speed 
in the opposite direction. However, RLEs may be designed 
to have processing symmetry in two or four directions, 
e.g., left-to-right, right-to-left or top-to-bottom, bot- 
tom-to-top or both. 
[0033] Programmer 112 may run any software 144 suitable for 
periodically configuring function region FR with the func- 
tional blocks, e.g., functional blocks FBI, FB2, FB3. Soft- 
ware 144 may utilize existing software modules/routines 
to perform the low level tasks necessary to configure 
function region FR and storage regions SRI, SR2. Such 
modules/routines are well known in the art. At a higher 
level, new software modules/routines must be written for 
software 144 in order to implement the function process- 
ing scheme of the present invention, whereby distinct, co- 
herent function region FR and storage regions SRI, SR2 
are designated and reconfigured as needed during peri- 
odic configuration cycles. With the present disclosure as a 
guide for generating these high-level routines, those 
skilled in the art will readily be able to write software 144 
necessary for programmer 112 to provide the functional- 
ity described herein. 



[0034] | n order to optimize a particular design of RLA system 

104, it will often be desired to place storage regions SRI, 
SR2 as close to corresponding edges of RLA 108 and also 
to maximize the size of function region FR and minimize 
the sizes of the storage regions. When this is done, a 
method of determining the sizes (in terms of RLEs) of 
function region FR and storage regions SRI, SR2 when 
function contains X processing logic elements and RLA 
contains Y programmable logic elements, may proceed as 
follows. First, the maximum number of storage elements 
SE needed for storage regions SRI, SR2 is subtracted from 
the Y logic elements of RLA 108 to yield a maximum 
number Z of logic elements that can be implemented per 
cycle. (The maximum number of storage elements can be 
approximated or determined by the maximum number of 
data paths.) 

[0035] Then, the X logic elements of function F may be divided 

by the Z function logic elements to determine the number 
W of functional blocks into which the function will be par- 
titioned (which also equals the number of cycles for con- 
figuring and processing the function). Of course, W must 
be an integer, such that any remainder must be rounded 
to the next highest integer. For each cycle, the inputs and 



outputs are defined. The maximum number of outputs for 
all W cycles can then be compared with the number SE of 
storage elements used to determine the number Z of pro- 
grammable logic elements. This can be an iterative pro- 
cess with possible trade-offs for smaller Z values to 
achieve a smaller maximum number of storage elements 
needed. The configuration of the Z logic elements for each 
of the W cycles may then be optimized and configured for 
the alternating directional flow of data discussed above in 
connection with FIGS. 3B-3D. The processing time to 
complete the partitioning of function F and the per-cycle 
optimization, placement and wiring becomes M cycles 
rather than the combinatorial explosion discussed in the 
background section above relative to conventional multi- 
cycle RLA processing. 
[0036] Referring to FIG. 4, and also to FIGS. 3A-3D, FIG. 4 illus- 
trates a timing diagram 200 for the three-cycle process- 
ing of function F as comprising a plurality of timing seg- 
ments T -T . During timing segment T , programmer 112 
programs function region FR of RLA 108 with functional 
block FBI and storage region SRI with storage elements. 
Once programmer 112 has programmed RLA 108 as re- 
quired, data may be launched during timing segment T 



into functional block FBI for processing. During timing 
segment T , functional block FBI processes the launched 
data and outputs data to storage region SRI. Timing seg- 
ments T through may be considered cycle 1 of RLA 
108 relative to processing of function F, since all pro- 
gramming and processing relating to functional block FBI 
occurs in these timing segments. 
[0037] During timing segment T , programmer 112 programs 

4 

function region FR with functional block FB2 and pro- 
grams storage region SR2 with the necessary storage ele- 
ments. Also during timing segment T , data in storage re- 

4 

gion SRI output from functional block FBI is held in state 
within storage region SRI while programmer 112 pro- 
grams function region FR and storage region SR2. During 
timing segment T , data from SRI is processed by func- 
tional block FB2, and the output of functional block FB2 is 
stored in SR2. Since the programming and processing re- 
lating to functional block FB2 are performed during timing 
segments T and T , these two timing segments may be 

4 5 

considered as forming cycle 2 relative to processing of 
function F. 

[0038] During timing segment T , programmer 112 programs 

6 

function region FR with functional block FB3. Also during 



timing segment T , data in storage region SR2 output 

6 

from functional block FB2 is held in state within storage 
region SR2 while programmer 112 programs function re- 
gion FR. During timing segment T , data from SR2 is pro- 
cessed by functional block FB3, and the output of func- 
tional block FB2 is passed out of RLA 108. Since the pro- 
gramming and processing relating to functional block FB3 
are performed during timing segments T and T , these 

6 7 

two timing segments may be considered as forming cycle 
3 relative to processing of function F. Again, while this 
example has been limited to function F having three func- 
tional blocks FBI, FB2, FB3 and, consequently, requiring 
three cycles 1, 2, 3, those skilled in the art will readily ap- 
preciate that the present invention can be modified to 
handle functions having any number of functional blocks 
and corresponding cycles. 
[0039] whereas RLA system 104 of the present invention dis- 
cussed above in connection with FIGS. 3 and 4 is generally 
shown and described as having storage regions SRI, SR2 
located proximate edges of RLA 108, this need not be the 
case. FIG. 5 illustrates an RLA system 300 of the present 
invention wherein some of the storage regions SRI , SR2 

n n 

are not located adjacent edges of RLA 304. In this exam- 



pie, programmer 308 may program RLA 304 for process- 
ing multiple functional blocks each corresponding to a 
different function, e.g., functions F , F , F 3 (not shown), 
simultaneously with one another. Similar to function F 
discussed above in connection with FIGS. 3 and 4, each 
function F , F , F may have associated therewith a corre- 

l' 2' 3 7 

sponding function region FR and corresponding storage 

n 

regions SRI , SR2 that may have similar structure and 

n n 

functionality to the structure and functionality described 
above in connection with function region FR and storage 
regions SRI, SR2 relative to FIGS. 3 and 4. Accordingly, 
during each cycle, programmer 308 would program each 
of function regions FR^ FR 2 , FR 3 with the corresponding 
functional blocks of functions F^ F , respectively, and 
these functional blocks would be processed and data 
therefrom stored in the corresponding appropriate stor- 
age region SRI , SR2 or passed out of RLA 304, as the 

n n 

case may be. 

[0040] it is noted that to optimize the performance of RLA 304, it 
would be advantageous that the RLA be designed to pro- 
cess symmetrically from left-to-right and right-to-left as 
well as top-to-bottom and bottom-to-top. Of course, RLA 
304 may be programmed in a configuration other than the 



quad-directional configuration shown. For example, if RLA 
304 were designed for top-to-bottom and symmetrical 
bottom-to-top processing, it may be desirable to process 
all three of functions F , F , F 3 vertically. Similarly, if RLA 
304 and its RLEs were designed for left-to-right and sym- 
metrical right-to-left processing, it may be desirable to 
process all three of functions F^ horizontally. Of 
course, even if RLA 304 is not symmetric in terms of pro- 
cessing efficiency in any direction, the functional blocks of 
functions F^ may nevertheless be cycled simultane- 
ously with one another, but perhaps with delay caused by 
slower processing in the less efficient direction(s). 
[0041] FIG. 6 illustrates another RLA system 400 of the present 
invention wherein RLA 404 processes two functions, e.g., 
functions OLE_LINK23F , F OLE_LINK23simultaneously 

l' 2 1 

with one another, but in opposite directions. The design 
of RLA system 400 recognizes the fact that when an RLA, 
such as RLA 108 of FIGS. 3B-3D, processes only one func- 
tion, data being "passed" from one functional block M to 
the next functional block M+l must be held in one or the 
other of storage regions SRI, SR2 (FIG. 3C). Thus, the RLEs 
being used to hold the data during reconfiguration of 
function region FR from functional block M to functional 



block M+l are not available for use during the processing 
of functional block M + l. In general, RLA system 400 of 
FIG. 6 utilizes these RLEs during each processing cycle, as 
opposed to every other processing cycle as discussed 
above relative to FIGS. 3-5, by processing two functions F 
i> F 2 simultaneously in two different directions. Thus, dur- 
ing each processing cycle, the storage elements (not 
shown) of storage regions SRI', SR2' would be clocked 
with new data each processing cycle. It is noted that this 
bi-directional processing would typically not be optimized 
for each processing cycle, but rather would need to be 
optimized across two consecutive cycles, e.g., cycles M, 
M + l. 

[0042] FIG. 7 A illustrates a further RLA system 500 of the present 
invention wherein programmer 504 configures RLA 508 to 
contain two function regions FR1', FR2' for processing 
successive functional blocks of a function having a plural- 
ity of functional blocks, e.g., functional blocks M through 
M + n. Referring back to timing diagram 200 of FIG. 4, it is 
seen that each of cycles 1, 2 and 3 includes a program- 
ming step in timing segments T , T and T , respectively, 

14 6 

followed by a processing step in timing segments T , T 5 
and T , respectively. Accordingly, during each of these 



timing segments, or cycles, RLA system 104 of FIGS. 
3B-3D is either in a programming state or in a processing 
state. With the two function regions FR1', FR2' of RLA sys- 
tem 500, however, programmer 504 can be configured to 
program one of the function regions while the other func- 
tion region is processing the corresponding functional 
blocks. 

[0043] Programmer 504 is also configured for setting up and re- 
programming storage regions SRI", SR2". For functional 
blocks other than the very first and the very last for which 
data may be passed directly into and directly out of, re- 
spectively, the functional block without needing to be held 
in one of storage regions SRI", SR2", first portions Pl^, P2 
of the storage elements (not shown) of respective storage 
regions SRI", SR2" must hold data generated by the cur- 
rent functional block M in one of the function regions FR1, 
FR2 while second portions P2 i , P2 2 of the storage ele- 
ments of the respective storage regions are being recon- 
figured to receive data from functional block M+2. 

[0044] FIG. 7B illustrates a timing diagram 600 for RLA system 
500 of FIG. 7A. At the beginning of the processing of a 
function (not shown) having a number of functional 
blocks, processing of the function may begin at timing 



segment T ' wherein programmer 504 programs function 
region FR1' with functional block M and portions Pl^ and 
P2 i of storage region SRI" with, respectively, storage ele- 
ments for functional block M and functional block M+2. 
At timing segment data is launched into functional 
block M. Functional block M is processed during timing 
segment T '. Simultaneously, programmer 504 programs 
function region FR2" with functional block M + l and por- 
tions Pl 2 and P2 2 of storage region SR2" with, respec- 
tively, storage elements for functional block M+l and 
functional block M+3. During timing segment!"^, func- 
tional block M + l is processed such that the output of 
functional block M+l is stored in portion Pl 2 of storage 
region SR2". Simultaneously with the processing of func- 
tional block M + l, programmer 504 reconfigures function 
region FR1' with functional block M+2 and reconfigures 
portion Pl i of storage region SRI" with storage elements 
for functional block M+4. During the following timing 
segment "ly, functional block M+2 is processed such that 
the output of functional block M+2 is stored in portion P2 
of storage region SRI". Simultaneously with the process- 
ing of functional block M+2, programmer 504 reconfig- 
ures function region FR2 1 with functional block M+3 and 



reconfigures portion Pl^ of storage region SR2' with stor- 
age elements for functional block M+5. This alternating 
programming of function regions FR1\ FR2' with succes- 
sive functional blocks and the corresponding program- 
ming of storage elements for functional blocks two blocks 
ahead proceeds until functional block M+n-2 is reached. 
At that time, storage elements for functional block M+n 
need not be programmed because the outputs of func- 
tional block M + n may be passed directly out of RLA 508, 
if desired. However, functional blocks M + n-1 and M+n 
must still be programmed into function regions SRI", 
SR2" for subsequent execution. Those skilled in the art 
will appreciate that timing diagrams other than timing di- 
agram 600 are possible. 
[0045] FIG. 8 shows yet another RLA system 900 of the present 
invention. Similar to the RLA systems 104, 300, 400, 500 
discussed above relative to FIGS. 3-7, RLA system 900 in- 
cludes an RLA 904 and a programmer 908 for reconfigur- 
ing the RLA on a cyclical basis. However, instead of pro- 
grammer 908 programming RLA 904 to contain various 
storage regions, e.g. storage regions SRI, SR2 of FIGS. 
3B-3C, RLA system 900 utilizes fixed memories FM1 and 
FM2 located adjacent the RLA. In general, RLA 904 can be 



programmed to function in a manner very similar to RLA 
systems 104, 300, 400, 500 discussed above in connec- 
tion with FIGS. 3-7, the primary difference being that pro- 
grammer 908 does not configure fixed memories FM1, 
FM2 in the manner discussed relative to the other RLA 
systems since the fixed memories do not contain RLEs, 
but rather fixed storage elements. Fixed memories FM1, 
FM2 may be of any suitable type, such as SRAM, register 
array or flip-flop, among others. Those skilled in the art 
will readily understand how to interface fixed memories 
FM1, FM2 with RLA 904, such that a detailed explanation 
need not be provided herein for those skilled in the art to 
implement RLA system 900. 
[0046] while the present invention has been described in con- 
nection with a preferred embodiment, it will be under- 
stood that it is not so limited. On the contrary, it is in- 
tended to cover all alternatives, modifications and equiva- 
lents as may be included within the spirit and scope of the 
invention as defined above and in the claims appended 
hereto. 



